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ABSTRACT 

We write the correlation function of dark matter particles, £(r), as the sum of two 
terms — one which accounts for nonlinear evolution, and dominates on small scales, 
and another which is essentially the term from linear theory, and dominates on large 
scales. We use models of the number and spatial distribution of haloes and halo density 
profiles to describe the nonlinear term and its evolution. The result provides a good 
description of the evolution of £(r) in simulations. We then use this decomposition 
to provide simple and accurate models of how the single particle velocity dispersion 
evolves with time, and how the first and second moments of the pairwise velocity 
distribution depend on scale. The key idea is to use the simple physics of linear theory 
on large scales, the simple physics of the virial theorem on small scales, and our model 
for the correlation function to tell us how to weight the two types of contributions 
(linear and nonlinear) to the pairwise velocity statistics. When incorporated into the 
streaming model, our results will allow a simple accurate description of redshift-space 
distortions over the entire range of linear to highly nonlinear regimes. 

Key words: galaxies: clustering - cosmology: theory - dark matter. 
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1 INTRODUCTION 

Strong constraints on models of large scale structure follow 
from combining statistics of the density field with statistics 
of the velocity field. In this paper, we show how the number 
of particle pairs depends on pair separation, and use this 
to compute the mean and mean square pairwise velocity. 
That is, we show how the correlation function of the density 
field and the distribution of pairwise velocities can all be 
computed from the same model. 

The key to being able to do this is a simple model of how 
and why the correlation function £(r, a) evolves with time. 
The evolution of £ was first accurately modelled by Hamil- 
ton et al. (1991). Following Peebles (1980), they showed that 
knowledge of how the correlation function evolved allowed 
them to describe how the mean streaming velocity ui2(r, a) 
of particle pairs evolved as well (also see Nityananda & Pad- 
manabhan 1994). Peebles suggests that the second moment 
of the pairwise distribution, o\2 (r, a) depends on (an integral 
over) the three-point correlation function (^(r,a); because 
there is no general description of £ from linear to non-linear 
scales (but see Scoccimarro & Frieman 1998 for scale-free ini- 
tial conditions), there is, at present, no simple description of 



how the pairwise velocity dispersion depends on scale (see, 
e.g., Mo, Jing & Borner 1997; Jing, Mo & Borner 1998). 

In this paper, we will follow a different approach than 
the one laid down by Hamilton et al. The logic behind our 
approach follows from the models first discussed by Ney- 
man & Scott (1959), and references therein. In these models, 
all particles are assumed to be in collapsed haloes, and the 
correlation function of the particles depends on the density 
profiles as well as on the spatial distribution of the parent 
haloes. What has changed since those early days is that we 
now understand that, for statistics like the correlation func- 
tion, the most important parameter of a halo is its mass. 

Following Sheth & Saslaw (1994), Sheth & Jain (1997) 
showed that they were able to provide a good description 
of the dark matter correlation function on small scales even 
though they neglected the fact that the parent haloes are 
clustered. This works because most close pairs are actu- 
ally in the same halo, and so the correlation function de- 
pends only on the distribution of particles within haloes 
(the halo density profile) and not on the spatial distribution 
of the other haloes. Using formalism presented in McClel- 
land & Silk (1977), they showed how to write the correla- 
tion function as an integral over haloes having a range of 
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masses. They then used simple analytic approximations for 
n(m), the number density of haloes (the mass function for- 
mula of Press & Schechter 1974), and halo density profiles 
(power- laws with slopes chosen to agree with simulations), 
to present their results. 

The results of e.g., Scherrer & Bertschinger (1991), show 
that one really expects the power-spectrum to be the sum 
Plhalo(fc) + P2haio(fc), where P 2h aio(fc) is the contribution 
from pairs in different haloes (i.e., the term that was ne- 
glected by Sheth & Jain). More recently, Seljak (2000) and 
Peacock & Smith (2000) have extended the model calcu- 
lation to include these terms. They used slightly more ac- 
curate fitting formulae for the inputs (the mass function is 
from Sheth & Tormen 1999 and the halo profiles are from the 
work of Navarro, Frenk and White 1997). Their results show 
that a good approximation to the evolved P(k) on all scales 
can be got by simply setting P n i(fc) = Pihalo(fc) + Po(fc), 
where the first term is that due to the halo profiles and 
mass function, and the second is the initial power-spectrum 
evolved to the present time using linear theory. 

Section of this paper studies the two-point correlation 
function, £(»*), rather than its Fourier transform, P(fc). Of 
course, in this case also, £(r) = CiimioM + 6maio(r), but 
we feel that explicitly working in real space shows the sorts 
of approximations which lead to this decomposition more 
clearly. It also shows why, both in real and in Fourier space, 
the 2-halo term should approximately equal the linear theory 
expression. 

Since this is a model in which the contributions to £(r) 
are written as functions of halo mass, we can study the 
Layzer-Irvine cosmic energy equation as a function of halo 
mass. We show this in Section 3. Our analysis shows which 
terms in the energy equation come from nonlinear virial mo- 
tions within each halo, and which from the motions of haloes 
as a whole which, following Sheth & Diaferio (2000), are 
more in line with what the linear theory would predict. We 
show how our decomposition allows us to provide a simple 
estimate of the single particle velocity dispersion — which can 
be thought of as the density weighted temperature. 

The requirement of pair conservation provides a relation 
between the correlation function £(r) and the first moment 
of the pairwise velocity distribution vi2(r). In Section we 
use our decomposition of £ to provide a simple expression 
for the mean streaming velocity, V12, and then show that 
this expression describes measurements in simulations quite 
well. 

Peebles (1980) shows that the second moment, <Ti2(r), 
is related to (an integral over) the three-point correlation 
function, (. This relation between the pairwise dispersion 
and £ is often called the cosmic virial theorem. Since our 
halo-based approach allows us to model £ as well (Scocci- 
marro et al. 2000), we could, in principle, use this to study 
how ai2(r) depends on pair separation. In Section H we de- 
scribe what we think is a much simpler way to think about 
and model (712(f). In our approach, the important ingredi- 
ent is not £, but knowledge of how velocities are correlated. 
We first show that neglecting these correlations is a rather 
good approximation: our models of £ and the single particle 
velocity dispersion are sufficient for describing the main fea- 
tures of <Ti2- We then describe a simple model for including 
the effects of velocity correlations. 

Knowledge of both the mean and the dispersion of pair- 



wise velocities are useful for modelling redshift space distor- 
tions, which we plan to present elsewhere. The three veloc- 
ities we study here, the single particle velocity dispersion 
(v 2 ), the mean streaming velocity «i2(?"), and the pairwise 
dispersion (712(f) ma y a U be used to provide estimates of 
the density of the Universe, although Jenkins et al. (1998) 
discuss why, in cluster normalized CDM models, different 
cosmological models may have rather similar values of V12 
and (7i2. 



2 THE CORRELATION FUNCTION 

Let p(r\m) denote the shape of the density profile of a halo 
which contains mass m; the mass in a spherical shell at dis- 
tance r from the centre of the halo is iirr 2 p(r\m) dr. Let 
X(r\m) denote the convolution of such a profile with an- 
other of exactly the same shape. For spherically symmetric 
density profiles 



X(r\rn) = 2ty / da;i x 1 p(xi\m) / d/3 p(x2\m) 



(1) 



where x\ — x\ + r 2 — 2xir(3. The contribution to the corre- 
lation function from pairs in which both particles are in the 
same halo is given by weighting the convolution profile of a 
halo of mass m by the number of haloes of mass m, and in- 
tegrating over m (Sheth & Jain 1997). The total correlation 
function is the sum of this plus a term which arises from 
pairs which are in two different haloes. This means that the 
second term is a convolution of the profiles of each of the 
two haloes involved, with the halo-halo correlation function: 



A(r|mi,m2) = 



d 3 ri 



d r 2 p(ri\mi)p(r 2 \m2) 



6*(l 



n 



- r 2 + r\ 



mi, 7712 



(2) 



Suppose that the halo-halo correlation function changes 
slowly on separations which are large compared to the typi- 
cal size of a halo. Then, at large r, the halo-halo correlation 
function can be taken outside the integrals, leaving just the 
convolutions over the profiles. But these each contribute a 
factor which equals the mass of the halo, since any position 
within the halo gives approximately the same pair separa- 
tion. This means that 



A(r|mi, m.2) ~ mi ni2 £,hh(r\mi,iri2) 



(3) 



at large separations. 

To proceed, we need a model for £hh(r| m i> m 2)- This 
has been done by Mo & White (1996) and Sheth & Lem- 
son (1999). On large scales, Mo & White argued that the 
correlation function of haloes of mass m should simply be 
a constant times the correlation function of the dark mat- 
ter, £hh(f) — b 2 (m) £(r), and that the value of the constant 
should depend on the halo mass. Sheth & Tormen (1999) 
showed that this dependence on mass can be derived from 
the shape of the mass function n(m), which, in turn, depends 
on the initial shape of the power spectrum. In addition, on 
large scales, linear theory should apply, and so £(r) (a (,o{r), 
where £o(r) is the linear theory correlation function of the 
dark matter. Thus, for large separations, 



6ih(r|mi,m 2 ) « 6(mi)6(m 2 ) £o(r). 



(4) 
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On small scales the halo-halo correlation function must 
eventually turn over (haloes are spatially exclusive — so each 
halo is like a small hard sphere). So setting £hh{r\mi, 7772) w 
6(mi) 6(7712) £(»") will almost surely overestimate the true 
value. Using the linear, rather than the nonlinear correla- 
tion function, even on small scales, is a crude but convenient 
way of accounting for this overestimate. (Although the re- 
sults of Sheth & Lemson 1999 allow one to account for this 
more precisely, it turns out that great accuracy is not re- 
ally needed since, on small scales, the correlation function is 
determined almost entirely by the one-halo term anyway.) 

If we allow a range in halo masses then the total corre- 
lation function is 

£(r) = Cihaio(r) + ^2haao(r-) 

77(777) X(r\m) 

dm _ ' _ ' + 



dmi 



P P 

77(7711) 



n(m2) . , . ._. 

d?772 — = — A(r\mi,m,2), (5) 



where 77(777) dm denotes the number density of haloes which 
have mass in the range dm about 777, and p is the average 
density. The first term dominates on small scales, and the 
second term dominates on large separations. Now, on large 
scales A(r|mi, 777,2) is well approximated by the product of 
777i 6(7711), 7772 6(7772) and £n(r). We will not do too badly if we 
continue to use this approximation on smaller scales because 
the scales on which it breaks down are precisely those on 
which the first term begins to dominate. This means that 
the second term can be written as the product of two one 
dimensional integrals. Moreover, the bias factors are defined 
so that 



m 1 n(m 1 ) 
ami - 6(7771) 



(6) 



Therefore, the second term really is very simple: to a good 
approximation, 



ew 



77(777) \(r\m) , , , 
dm-^ v 1 ' +go(r). 



(7) 



Note that, in this approximation, the second term is the 
same for all halo profiles — different profile shapes yield dif- 
ferent shapes for A(r|?77) and so result in different correlation 
functions on small scales only; in principle, one can use mea- 
surements of the shape of the correlation function at small 
separations to constrain the shapes of profiles. 

To illustrate how the profile affects the correlation func- 
tion, in what follows we will use the NFW profile of Navarro, 
Frenk & White (1997). In Appendix A we provide expres- 
sions for X(r\m) for the NFW profile truncated at the virial 
radius, the Hernquist profile (1990), and the singular isother- 
mal sphere, truncated at the virial radius so that it has finite 
mass. These three density profiles all have the property that 
\(r\m) is never less than zero. As a result, £ihaio is also 
never less than zero. This means that the integral of £ihaio 
over all separations does not equal zero. As a result, equa- 
tion (H) above does not satisfy the integral constraint. This 
is a formal feature of the models to which we will return 
later. 

The truncated NFW profile above has two free parame- 
ters, a core radius and an average density. Navarro, Frenk & 
White (1997) showed that, in their simulations, the core ra- 
dius depends on the mass of the halo, whereas all haloes have 




0.1 1 10 

r [Mpc//i] 



0.1 1 10 

r [Mpc/ft] 



Figure 1. The correlation function of dark matter particles. 
Symbols show the fitting formula provided by Peacock & Dodds 
(1996). Solid curves show the correlation associated with NFW 
profiles. The two dashed curves show the contribution from pairs 
in the same halo (dominates at small r), and pairs in different 
haloes (dominates at large r). Dotted curve shows the linear the- 
ory correlation function £o ; on the large scales where the two-halo 
term dominates, it is very well approximated by the linear theory 
function. 



the same average density, whatever their mass. They found 
that the core radii of massive haloes are at larger fractions 
of their virial radii than for less massive haloes; less mas- 
sive haloes are more centrally concentrated. The ratio of the 
core radius to the virial radius increases with mass in a way 
which depends on the shape of the initial power-spectrum — 
they discuss in detail why this is so. In what follows, we will 
use the simple analytic approximation to this relation given 
in Scoccimarro et al. (2000) as we integrate A(r|777) over the 
mass function: aNFw(TTi) = (m/?77„) ' 13 /9, where 777* is the 
average mass contained in a tophat filter whose scale is set 
by the requirement that the rms value of the initial density 
fluctuation field smoothed with the filter, extrapolated to 
the present using linear theory, is S c « 1.68. 

Fig. hi shows the correlation functions one obtains by in- 
serting the core-radius relation given above into our expres- 
sions for X(r\m), and integrating over the mass function. 
The two panels show two variants of the CDM family of 
initial conditions: our SCDM model has Qo — 1, ft — 0.5 
and o"8 = 0.5, whereas our ACDM model has flo = 1, 
Ao = 1 — fio, h — 0.7 and as = 0.9. The symbols show the 
fitting formula of Peacock & Dodds (1996) which describes 
the correlation functions from numerical simulations of these 
cosmological models well (see, e.g., Mo, Jing & Borner 1997; 
Jenkins et al. 1998) , and the solid curve shows our model us- 
ing NFW profiles. The two dashed curves, which sum to give 
the solid curve, show the two contributions to the correlation 
function; the curve which dominates on small scales shows 
the contribution from pairs in the same halo — the first term 
in equation (ra). The curve which dominates on large scales 
shows the contribution from pairs in different haloes. The 
dotted curve shows the linear theory correlation function; 
on the scales where the two-halo term dominates, the linear 
theory function provides an excellent approximation. The 
model provides a good description of the correlation func- 
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tion of the dark matter. Although we have only presented 
results for z — 0, the model is also accurate at earlier times. 



3 THE COSMIC ENERGY EQUATION 

The cosmic energy equation describes how the energy of the 
Universe is partitioned between kinetic and potential en- 
ergy. In essence, it provides a relation between the correla- 
tion function of dark matter particles and their rms speeds. 
Following Hamilton et al. (1991), we know how to compute 
the shape of the correlation function (i.e., how it depends on 
separation r) at any time, if its initial shape is known (also 
see Nityananda & Padmanabhan 1994). Mo, Jing & Borner 
(1997) showed that inserting this evolution into the cosmic 
energy equation provided a very good description of how the 
rms velocities of dark matter particles in their simulations 
evolved. How much of the evolution of the rms velocity is 
driven by nonlinear effects within virialized clusters, which 
produce large velocities, and how much contains information 
about the (linearly evolved) initial conditions? We show be- 
low that the model described above allows us to separate 
out the linear from the nonlinear effects. 

The cosmic energy equation (Irvine 1961, 1965; Layzer 
1963, 1964) is an exact statement of what energy conserva- 
tion means in an expanding universe. It provides a relation 
between the potential energy of the system, 



W = Wlinear + W no 



(11) 



W(a) = ~2TvGp(a) / dr r£(r,a), 



(8) 



and the kinetic energy K(a), which is essentially one half 
times the mean square velocity of the particles in the system. 
(Note that p oc a~ 3 , where a denotes the scale factor of the 
Universe, and not the core radius of the halo profile!) If the 
evolution of W is known, then the cosmic energy equation 
allows one to compute the evolution of K also. Since we 
know how £(r, a) evolves, we know how W evolves, so we 
can compute K(a) also. Mo, Jing & Borner (1997) showed 
that by combining the Hamilton et al. (1991) evolution of 
£(r, o) with the cosmic energy equation they were able to 
compute a good approximation to the value of the single 
particle velocity dispersion at any given time. In particular, 
by integrating the energy equation once, Mo et al. (1997) 
showed that 



(v (a)) = - n(a)H 2 (a) a 2 1 (a) 1 - 



I(a ) da 

1(a) a 



(9) 



where W (a) = — 2ivGp(a) a 1(a). Although this expression 
is exact, Mo et al. also showed that the simpler expression 
which follows if 1(a) oc D(a) 2 , as it would in linear theory, 
is a good approximation: In this case 



(v (a)) « -Q(a)H 2 (a)a 1(a) 



D 2 (a')da' 
aD 2 (a) 



(10) 



Davis, Miller & White (1997) were interested in separat- 
ing out those contributions to the velocity dispersion which 
arise from nonlinear effects from those which are given by 
linear theory. Since we know how to write £(r,a) as a sum 
of two terms, one from linear theory and the other nonlin- 
ear, we can begin to address some of the issues they raised. 
Specifically, suppose we write 



and we require that it equals the expression for W above. 
Linear theory, extrapolated to the present time, would have 



W(a) = ~2TvGp(a) / dr r£ (r) 



(12) 



where £o denotes the linear correlation function (e.g. Pee- 
bles 1980). However, our model for the nonlinear correlation 
function is to write £(r) as a sum of this linear part, plus a 
term which depends on halo profiles. Thus, we find that 



^nonlinear = -2ttG / dm 



n(m) 



dr r X(r\m) 



(13) 



where we have rearranged the order of the integrals so 
that we do do the one over r before integrating over m. 
The resulting integrals over r are computed explicitly in 
the Appendix. In particular, for the halo models of in- 
terest in this paper, the haloes are in virial equilibrium: 
Wnoniinear = — 2A~ n0 nimoar • Thus, in our approach, the ra- 
tio of the nonlinear to the linear theory term depends on 
the mass function and the density profiles of dark matter 
haloes. For example, for NFW haloes, — H^noniinoar equals 
J dm [n(m)/p] (Grn 2 /r v ir) times a constant which depends 
on the ratio of the core radius to the virial radius. 

We will not explore this further in this paper. For the 
time being, we will simply use this expression to estimate 
how the single particle velocity dispersion evolves. Namely, 
we will use linear theory extrapolated to the present time to 
estimate Ilnnear (see, e.g., Peebles 1980) and our models of 
halo profiles to estimate the other contribution to IFnoniincar, 
and we will then use the energy equation to derive (v 2 ). We 
will need this single particle {v 2 } later on, when we study 
how the pairwise velocity dispersion depends on pair sepa- 
ration. For the CDM models presented in this paper, this 
gives (w 2 )scdm = 675km/s and (w 2 )a / c DM = 590 km/s, in 
good agreement with the values measured in the simulations 
(e.g. Mo, Jing & Borner 1997; Sheth & Diaferio 2000). Be- 
cause our models allow us to compute this single particle 
dispersion at any times, they provide a simple way of com- 
puting the evolution of the density weighted temperature of 
the Universe. 



4 THE MEAN STREAMING VELOCITY 

The scale dependence of the mean streaming viz (r) of dark 
matter particles has been understood for some time now. 
Hamilton et al. (1991) showed that because they could pro- 
vide good estimates of the evolution of £(r,a), for any ini- 
tial correlation function, they could also describe the shape 
of v\2(r) (also see Nityananda & Padmanabhan 1994). Be- 
cause the halo model presented in the previous section al- 
lows one to compute £(r,a), by following the steps outlined 
by Hamilton et al., it can also be used to compute v\i(r). 
We will not do this here. Rather, we will show that because 
our halo model allows one to split the correlation function 
up into linear and nonlinear parts, we are able to compute a 
good approximation to Vi2(r) rather more simply. Recently, 
Juszkiewicz, Springel & Durrer (1999) have presented a fit- 
ting formula for this statistic; they argue that their formula 
is simpler to use than the exact method of Hamilton et al. 
The results presented in this section can be thought of as 
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providing a simple physical reason for the values of the co- 
efficients in their fitting formula. 

The relevant starting point is the pair conservation 
equation in Peebles' book (Peebles 1980): 



8(1+0 v 12 (r) 



do 



Hr 



l + £(r) 



(14) 



where £(r, a) is the volume averaged correlation function on 
comoving scale x — r/a at the time when the expansion 
factor is a, and the Hubble constant is H. Note that the 
partial derivative with respect to a on the left hand side 
keeps x fixed rather than r. This says that if we know the 
correlation function for all scales x and all times a, then we 
can compute how «i2 (r) depends on scale today; basically it 
comes from assuming that the number of pairs is conserved. 
Hamilton et al. also showed that by inserting their expres- 
sion for the evolution of £(r,a) into equation ([14]) above, 
they were able to describe U12 (r) accurately. 

A simpler analytic approach follows from reading 
through Peebles' logic further. He notes that an approx- 
imate solution to equation ( |l4| ) can be got by assum- 
ing that £ evolves according to linear theory: £(r = 
ax, a) — [D(a)/D(ao)] 2 £(r — aox,ao), where D(a) is 
the linear theory growth factor. Then the left hand 
side is ad^(ax,a)/da — 2f(Q,)£(ax,a), where /(fl) = 
d\nD/d\na ~ fi 0,6 . So, in this approximation we get 



ma _ 2 /(n)£(r,a) 
Hr 3 l+£(r,a) 



(15) 



This is just the usual linear theory expression with an extra 
factor of (l+£) in the denominator. Juszkiewicz et al. (1999) 
show that, while this approximation is fine on large scales, 
it underestimates the exact solution by a factor of 3/2 or so 
on smaller scales. They use perturbation theory to motivate 
the introduction of the extra terms they must add to this 
expression to rectify this problem. 

Our model provides another simple way to see what 
these terms should be. Since the previous section allows us 
to write £ as a sum of two terms, we can work out how 
each one scales with time. In our model, the term which 
dominates on small scales evolves both because the mass 
function evolves, and because the concentrations of haloes of 
a fixed mass depend on when they formed. The term which 
dominates on larger scales is very similar to that predicted 
by linear theory. If we assume that it evolves according to 
linear theory, then 



«12 _ 1 

Hr ~ 3[l+£(r,a) 



2/(n)&haio(r,a) + 



0fr 



<91na 



(16) 



where the derivative of £ihaio with respect to expansion fac- 
tor a can be evaluated because the dependence of the mass 
function n(m, a) and the halo profiles and their convolutions 
\(r\m) on a are all known. Thus, we have 



d£i 



<91nm* 



<91na 



<91na 






£ihaio{r,a) - 

dr r 
o Jo 

X(r\m) dlnX <91nc 



£lhalo(f, o) 

SO / \ 

dm^M 



d lnc d lna 



(17) 



halo profile (so cnpw = 1/gnfw) to avoid confusing the 
core-radius relation with the cosmological expansion factor. 
The notation 91nc/91na| m , denotes a derivative with re- 
spect to lna keeping m/m* fixed. If the time dependence 
of c comes only from its dependence on m*, then the final 
term on the right hand side vanishes, making the expression 
particularly simple. If, in addition, the correlation function 
was a pure power-law of slope, say, 7, then £ = (3 — 7)£/3. 
In this case, the term in square brackets in the expression 
above would become (7/3) £ihaio- 

To illustrate how the one-halo term scales, it is conve- 
nient to study a spectrum with shape P(k) oc k n initially. 
In this case <91nm*/<91na = /(Q) 6/(3 + n). On very small 
scales, £ ~ Cihaio is expected to be a power law of slope 
7SC « 3(3 + n)/(5 + n) (e.g. Peebles 1980). This makes the 
right hand side of equation ( |l7|) equal to /(fi) 6/(5 + n) 
times fihalo' This is the same as the scaling required by sta- 
ble clustering (e.g. Hamilton et al. 1991). On intermediate 
scales, Padmanabhan (1996) argues that £ should be ap- 
proximately a power-law of slope 7ql ~ 3(3 + n)/(4 + n). 
If we still set 1 + £ ~ £ihalo, then the right hand side of 
equation ( |l7| ) becomes f(£T) 6/(4 + n) times fihaio- This is 
precisely the quasi-linear scaling assumed by Padmanabhan 
(1996). On large scales £ has slope 7l = (3 + n), and so the 
right hand side of equation (|l7|) becomes 2 /(fi) fihalo, which 
is the same as the scaling required by linear theory. Thus, 
our halo profile term (equation |l7|) interpolates smoothly be- 
tween these different regimes. Of course, because £ihaio(»") is 
not really a power law, it never actually obeys these scalings 
exactly. Hamilton et al. (1991) assumed that D12 could be 
written as a function of £ alone. Because our one-halo term 
actually depends both on £ihaio as well as on £ihaloj our halo 
models are formally inconsistent with the Hamilton et al. 
ansatz. For example, the ansatz is based on an assumption 
that clustering on small scales is stable, whereas our halo 
models are not. Ma & Fry (2000) explore some consequences 
of this. 

Fig. B shows that our model, 



Via 
Hr 



m) 



3[l+£(r,a)] 



2*2halo(r,a) + 



3 + n* 



£ihaio (r, o) - Cihaio {r,a) 



(18) 



Here c denotes the inverse of the core radius relation of the 



where n* = —1.33,-1.53 is the slope of the power spec- 
trum on the scale ra* for the SCDM and ACDM models 
we consider in this paper, is quite accurate. The triangles 
show measurements from the publically available Virgo sim- 
ulations (Jenkins et al. 1998), and crosses show the fitting 
formula which Juszkiewicz et al. (1999) obtained by fitting 
to these simulations. The open circles in the panel on the 
right show the streaming motions in the ACDM GIF simu- 
lation (e.g. Kauffmann et al. 1999). The GIF SCDM box is 
only 85 Mpc//i on a side. As a result, the velocities in it are 
strongly affected by the finite size of the box (Sheth & Di- 
aferio 2000), which is why we have not shown V\2 from this 
simulation. The solid curve shows our model (equation |l6| ) 
which accounts for the fact that evolution on small scales is 
nonlinear, and weights by the relative fractions of linear and 
nonlinear pairs. It is the sum of two terms; these are shown 
as the two dashed curves. Using the linear theory correla- 
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Figure 2. The mean streaming velocity of dark matter parti- 
cles. Triangles show the Virgo simulation measurements, circles 
show the GIF ACDM simulation, and dot-dashed curves show the 
Hubble expansion velocity. Crosses show the result of using the 
Peacock & Dodds (1996) formulae for the correlation function in 
the fitting formula provided by Juszkiewicz et al. (1999). Solid 
curves show the model described in the text which accounts for 
the fact that the nonlinear evolution is different from what linear 
theory predicts, and then weights the linear and nonlinear scal- 
ings by the relative fractions of linear and nonlinear pairs. Dashed 
curves show the two contributions to the streaming motion in our 
model; the curves which peak at large r are for pairs in two dif- 
ferent haloes. Dotted curve shows the approximation of using the 
linear theory correlation function to model this two-halo term. 
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Figure 3. The ratio of the mean streaming velocity of dark mat- 
ter particle pairs separated by r, to the Hubble expansion on that 
scale. As in the previous Figure, triangles show the Virgo simu- 
lations, circles show the ACDM GIF simulation, crosses show the 
Juszkiewicz et al. (1999) fitting formula, and solid curves, which 
are the sum of the dashed curves, show our model predictions. 



tion function (the dotted curve in the previous figure) to 
model the contribution from the two-halo term (the dashed 
curve which dominated on large scales in the previous fig- 
ure) corresponds to setting ^2haio — ► Co- This approximation 
is shown as the dotted curves. The dot-dashed curves show 
the Hubble expansion velocity for comparison. 

To study the approach to stable clustering, it is more 
useful to show the ratio of the streaming velocity to the 



Hubble expansion. Recall that one might have expected the 
smallest scales to have —Vvx /Hr = 1. This follows from the 
simple scalings we discussed earlier (assuming flo = 1): set 
Cihaio = 3£ihaio/(3-7sc), and use the fact that l+£ « £ihaio- 
Fig. kfl shows that on small scales the mean streaming mo- 
tions in our halo models are smaller than the stable cluster- 
ing limit. The streaming motions do not cancel the Hubble 
expansion. As before, the triangles in the two panels show 
the Virgo simulations, the crosses show the fitting formula 
to these simulations, and circles in the panel on the right 
show the streaming motions in the ACDM GIF simulation. 
Although our model v\i curves (solid lines) fall below the 
stable clustering limit, they are in quite good agreement 
with the simulations. 

Notice that, on scales slightly smaller than a Mpc/fa or 
so, the one-halo term in our models can exceed the Hubble 
velocity. This shows explicitly that, even if clustering were 
to approach the stable clustering limit, it would have to do 
so on scales which are smaller than the virial radii of haloes. 
This is consistent with arguments in Sheth & Jain (1997). 
Another way of saying this is that, if — v\i equalled Hr out 
to the virial radii of all haloes, then the one-halo contribu- 
tion to the mean streaming velocity would be fihalo/(l + : 
this would give a curve which decreased monotonically from 
unity as r increased, whereas our one-halo term actually has 
a peak at ~ lMpc//i. On these slightly larger scales, writ- 
ing our previous scalings for power-law profiles in terms of 
£ with slope 7ql shows that —v\2JHr = 2f(Q,). This value 
is in reasonable agreement with the height of the peak of 
the solid curve in both panels of Fig. pi Higher resolution 
simulations are required to determine whether clustering is 
stable on scales smaller than about 0.1Mpc/ft. 

The careful reader will have noticed that our halo mod- 
els slightly overestimate the amplitude of the streaming mo- 
tions on large scales. Some of this effect may be due to our 
simple treatment of ^2haio • If we were to set ^2haio — > f o we 
would overestimate the true value even more (the dotted 
curves are always above the dashed ones). However, notice 
that the single-halo contribution to W12 is nonzero even at 
separations larger than 10Mpc/ft, and that it approximately 
accounts for the overestimate on these scales. Where does 
this large scale single-halo contribution come from? It does 
not arise from particles which are falling towards each other 
from opposite sides of a few proud monster haloes! (A viri- 
alized halo with a radius of 5 Mpc/h would have a mass of 
about 2.5f2 x 10 16 M@//i.) Rather, at least some of the over- 
estimate of V12 arises from the fact that, formally, the halo 
models violate the integral constraint. As mentioned earlier, 
the integral of the halo model correlation function over all 
separations does not equal zero. This means that the model 
overpredicts the value of the true fihaio- A glance at Fig. hi 
shows that £ihak> on the scale of 5 Mpc//i is negligible. This 
in equation ( h_7| ) shows that, on these scales, the single-halo 
contribution to W12 is determined almost entirely by the vol- 
ume average term, and this results in an overestimate. 

One way of remedying this is to make the halo profiles 
compensated, say, by embedding them in slight underden- 
sities. This would have the effect of making the large scale 
value of £ihaio go slightly negative, with a more dramatic ef- 
fect on large scale values of the volume average £ihak>- While 
compensated profiles may be physically reasonable, and are 
certainly of formal interest, we have not pursued this fur- 
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ther, because the large scales where the spurious streaming 
velocities appear are also those where the two-halo term 
dominates the pair statistics. 

Before we move on to the second moment of the pairwise 
velocity distribution, we think it is useful to rewrite our 
expression for the mean streaming motions one final time. 
If we use the fact that ^haio ~ £o, then 



^12 

Hr 



Hr l + £(r) 



+ 



a/(n) 

3 + n* 



ClhaloM 



£lhalo(»") 



- 1 



Clhalo(r) 

l + *(r)' 



(19) 



where we have omitted writing factors of a throughout for 
brevity, and we have defined — v\™ / Hr = 2fo/3[l + Co]) 
which can be written entirely in terms of linear theory quan- 
tities. The factors £ihaio/[l+£] and [l + £o]/[l + £] are simply 
the fractions of pairs from particles in the same halo, and in 
separate haloes, respectively. This form shows clearly that 
the streaming motions arise from applying linear theory to 
the pairs in separate haloes, nonlinear theory to the pairs 
in the same halo, and weighting by the fraction of pairs of 
each type. For instance, stable clustering would yield unity 
times the nonlinear pair-weight term, and the quasi-linear 
scaling described above would yield two times the nonlinear 
pair-weight term. It is this sort of decomposition into linear 
and nonlinear parts which we will exploit in what follows. 



5 THE PAIRWISE VELOCITY DISPERSION 

This section provides a simple model of how and why the 
pairwise velocity distribution depends on scale. The model 
constructed here is a natural generalization of that stud- 
ied by Sheth (1996), Diaferio & Geller (1996), and Sheth & 
Diaferio (2000). It is much simpler than the exact 'cosmic 
virial theorem' approach (Peebles 1980; Bartlett & Blan- 
chard 1995; Mo, Jing & Borner 1997) one is led to if one 
attempts to climb the rungs of the BBGKY hierarchy, one- 
by-one. 



5.1 The model assumptions 

Let u(r\2) denote the difference between the velocities of two 
particles separated by ria, along their line of separation. If 
the velocities of the particles separated by ria are indepen- 
dent of each other, then the shape of this distribution can be 
computed from knowledge of the shape of the single particle 
distribution function directly. On small scales, this assump- 
tion of independence is almost certainly wrong. However, in 
the small separation limit, progress can be made by assum- 
ing that all pairs at a given small ria are in the same halo. 
Since haloes are virialized, particle velocities within a halo 
are drawn from independent Maxwellian distributions, and 
so velocity differences are also Maxwellian, albeit with twice 
the dispersion of the single particle case. The dispersion of 
the Maxwellian depends on the mass of the parent halo in 
which the pair is, and so the full distribution of u(rvi) can 
be computed by integrating over the distribution of halo 
masses, weighting by the number of pairs which have sepa- 
rations ria within each halo of mass m. This model is studied 
in detail in Sheth (1996) and Diaferio & Geller (1996). Of 



course, their model only applies on scales where, for most 
pairs, both members are in the same halo. 

What happens at larger separations? In this subsection 
we will study the limit in which both members of the pair 
are in different haloes. We will argue that, in this limit, the 
distribution of u is also relatively simple. 

As for the correlation function, we begin by writing the 
dispersion a 2 2 (r) = (u 2 (r)) at separation r as the sum of 
two terms: 



C12 (r) = o-i ha i (r) + a 2ha , io (r), 



(20) 



where the first term arises from pairs in which both mem- 
bers are in the same halo (so it depends on properties of 
virialized haloes and dominates on small scales), and the 
second term is for pairs in different haloes (so it dominates 
on large scales). 

As mentioned above, the first term depends on the den- 
sity profiles of virialized haloes. There are two reasons why 
it is a function of scale. First, the velocity dispersion within 
a halo may depend on position within it, so the pairwise 
dispersion will depend on separation. However, the pairwise 
dispersion will depend on scale even if we neglect this de- 
pendence. To see why, note that we should get a reasonable 
estimate of this term by setting the dispersion equal to the 
circular velocity at the outside edge of the halo: Gm/rvir 
(this would be exact for an isothermal sphere, but is only 
approximate otherwise), and using this for all r < r v i r . In 
this case, 



cr lhalo( r ) — 



l+*(r) 



dm 



Gm n(m) \(r\m) 



(21) 



This expression is exactly like the one in Sheth (1996), ex- 
cept that we have multiplied it by an additional factor of 
A(r|m)/[l + £(r)] to account for the fact that only a fraction 
of the pairs at a given separation r are in the same m-halo. 

To see that this is the correct pair weighting factor, 
let (7(1, 2) d«id«2 denote the probability that there exists a 
particle in a cell of size dvi within a halo of mass m, and 
another particle in a cell of size dvi which is not necessar- 
ily in the same halo. Similarly, let P(l, 2|m) duidu2 denote 
the probability that there exists a particle in a cell of size 
dv\ within a halo of mass m, and that the other particle 
in dv2 is within the same halo. If each particle carries a 
mass m p , then U — (p/m p )dvi(p/m p )[l + £(r)]du2, and 
P — mn(m)dmdvi/m p [A(r|m)/m] dv2/m p . The ratio is the 
fraction of pairs at separation r which are both in the same 
m-halo: P/U = dmn(m)A(r|m)/(p/m p ) 2 [l +£(r)]. This is 
the weighting we used in the equation above. Note that this 
is consistent with equation (Tij): one plus the left hand side of 
that expression is the total number of pairs at separation r, 
and the first term on the right hand side is the contribution 
from pairs in which both particles were in the same halo. It 
is also the weighting associated with our final expression for 
the mean streaming motions (equation [19) . 

Allowing the dispersion to depend on position in the 
halo means that 



Gm 



A(r[m) 



2ty / dii Xi p(xi\rn) / d/3 p(x2\rn) 



where x\ = x\ + r 2 



a\ d {xi\m) + ai A {xi\m) , (22) 



- 2xir/3, and where the one dimensional 
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dispersion, a 2 d , can be approximated by, say, the radial ve- 
locity dispersion given in Appendix A. 

This shows that at small separations, cr 2 halo (r) increases 
with r because virial motions within haloes increase as m 2 ' 3 , 
and only massive haloes can contribute pairs at moderately 
large separations (there will be an additional scale depen- 
dence if we also included the dependence of the dispersion 
on position within the halo). Therefore the pairwise disper- 
sion will increase with increasing scale at small r. On larger 
scales, however, an increasing fraction of pairs are actually 
from different haloes. Since \(r\m) — > as r increases, on 
scales larger than that of a typical halo, crf halo (r) will even- 
tually decrease. 

The term in which particles are in different haloes is 



0"2haloM = 



dmidm; 



1 +£(mi,m2\r) 



min(mi) m2n(m2) 



P P 

where n(m) is the number density of m-haloes, 

£(mi,m 2 |r) = b(mi)b(mi) £o(r) 



S(mi,m 2 |r), (23) 



(24) 



is the correlation function of haloes, b(m) is the linear bias 
factor discussed earlier, £(r) is the correlation function of 
the mass, and S(mi,m2\r) represents the dispersion of the 
velocity difference along the line of separation between par- 
ticles separated by r that are in different haloes (one of mass 
mi and the other m 2 ). Whereas the other terms are simply 
the pair-weighting, the physics is in finding a convenient ex- 
pression for S. 

If a (to) denotes the dispersion associated with a single 
halo, then 



5(mi,m 2 |r) = a (mi) + a (m 2 ) — 2\l/(mi,m 2 |r), 



(25) 



where ^(m\,m2\r) represents the fact that the motion of 
a particle in halo mi may be correlated with that of the 
particle a distance r away in the halo m 2 . Later in this 
paper we will develop a model for the velocity correlation 
function. For the time being, we think it clearer to study a 
simpler case first — in what follows, we will neglect the fact 
that halo velocities may be correlated. 

Before we do so, notice that if haloes were spatially 
uncorrelated as well, then the pairwise distribution would 
be the difference of two random variates, so it would be a 
simple convolution of the single particle distribution derived 
by Sheth & Diaferio (2000). In what follows, we will study 
what happens if we account for the fact that haloes are spa- 
tially correlated, even though we neglect the fact that their 
velocities are also correlated. 



5.2 Neglecting velocity correlations 

Suppose we wish to compute the one-dimensional relative 
pairwise velocity dispersion along the line of separation. 
Since we are ignoring velocity correlations, the shape of the 
pairwise velocity distribution arises from applying the ap- 
propriate pair-weight as a function of separation, and inte- 
grating over the distribution of dispersions given by the halo 
mass function (equation (E3I) with ^ = 0). 

The pair-weighting involves the bias factor which de- 
pends on halo mass and on the shape of n(m); since we are 
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Figure 4. Scale dependence of the pairwise velocity dispersion. 
Triangles show the Virgo simulation results of Jenkins et al. 
(1998), circles show the GIF simulation results, and crosses show 
the fitting formula for the dark matter provided by Mo, Jing & 
Borner (1997). Solid curves show the scale dependence in our 
model which neglects the spatial dependence of the dispersion 
within a halo, and also neglects velocity correlations between 
haloes, but includes the effects of spatial correlations. Dashed 
curves show the contribution from pairs in which both particles 
are in the same halo (hence the peak at small r) and in separate 
haloes (so a peak at large r), respectively. 



using the mass function given in Sheth & Tormen (1999), 
we will use their formula for b(m) also. Let {v 2 } denote the 
single particle velocity dispersion (section H showed that it 
has contributions from the virial motions within haloes as 
well as from the motions of the haloes themselves) . Then the 
bias weighting means that we must compute 2 (bv 2 ). Thus, 
we arrive at a simple expression for the scale dependence of 
this second term: 



o- 2 haio(r) = - (v ) 



l+£ (r){bv 2 )/{v 2 ) 



(26) 



where {v 2 } is the three dimensional dispersion given by the 
cosmic energy equation. The factor of 2/3 arises because 
we are interested in the sum of two (assumed independent) 
velocity variates, and we are only interested in one of the 
three velocity components. 

At large separations f(r) ~ £o(?~), an d they are both 
<§C I, so cr 2halo — > 2(v 2 )/3, as one expects for indepen- 
dent variates. What about intermediate separations? In this 
model, cr| halo (r) depends on scale primarily because £(r) 
does. On intermediate scales (where the assumption that 
the velocities are independent is certainly wrong!), whether 
or not o" 2 haio increases or decreases with scale depends on the 
ratio {bv 2 )/{v 2 }. For example, the weighting by b increases 
the contribution from massive haloes relative to less massive 
ones. Since virial motions within massive haloes generate ve- 
locities that are larger than the rms, for the dark matter, this 
ratio is likely to be larger than unity. This means that er 2halo 
may be slightly higher on intermediate scales than on large 
ones. On small scales, the pair-weighting term £o/(l + C) 
decreases rapidly, so cr 2halo will also decrease. 

Fig. M compares our simple model with what is mea- 
sured in simulations. Circles show the GIF simulation re- 
sults, and triangles show the VIRGO simulation results of 
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Jenkins et al. (1998). Crosses show a fitting function, equa- 
tion (40) of Mo, Jing & Borner (1997), which should de- 
scribe the scale dependence of the pairwise dispersion. In all 
cases, we have chosen to follow standard practice and not 
centre the statistic: to centre, the mean streaming motion 
W12 should be subtracted in quadrature. 

Although the GIF and Virgo simulation results are in 
reasonable agreement on large scales, 0-12 is larger, by 
about 100 km/s, on scales smaller than a Megaparsec or 
so. Presumably, this difference arises from the fact that, al- 
though the two simulations used the same number of parti- 
cles, the boxes had different sizes and the particle masses 
were also different. The Virgo boxes had sides of length 
L = 240Mpc//i, the GIF ACDM box was 141 Mpc/h. The 
finite box size has two effects: first, large scale flows have 
smaller amplitudes in small boxes, and so this affects the 
large-scale value of the pairwise dispersion. In addition, 
small boxes do not have a fair sample of the massive haloes 
which dominate the pairwise velocity statistic at a Mega- 
parsec. Because massive haloes are rare, the pairwise statis- 
tic will have a large scatter; the large value of o\2 we mea- 
sured in the smaller GIF simulation may simply be a large 
fluctuation. On the other hand, because of its better mass 
resolution, the GIF box is able to resolve substructure within 
virialized haloes which the Virgo simulations can not. The 
presence of substructure will increase the velocity dispersion. 
However, because this contribution only adds in quadrature, 
it is not clear that this can account for all the difference. 

We have tried to incorporate the effect of the finite box 
size into our model by restricting our integrals over halo 
masses to the range m < W le MQ/h, and by only integrating 
over k > 2n/L when using the cosmic energy equation to es- 
timate {v 2 ). The dashed curves show the two types of terms 
in our model after setting the box size to L = 141Mpc//i. We 
chose to model the smaller box because the GIF semianalytic 
galaxy formation models of Kauffmann et al. (1999), which 
we will use later in this paper, use this same simulation. The 
dashed curve which peaks at small r is for pairs in which 
both particles are in the same halo, and the dashed curve 
which peaks at large r is for particles in different haloes. 
The solid curve is the sum (in quadrature) of the two con- 
tributions, plus a piece which comes from the fact that the 
statistic is not centred (see below). Our model appears to 
describe the main features of CT12 (r) reasonably well. 

On small scales, discrepancies between the model and 
simulations are not caused by our neglect of the halo velocity 
correlation function; this regime is dominated by pairs which 
are in the same halo, which suggests that it is the assump- 
tion that the pairwise dispersion within haloes is isotropic 
and independent of position within the halo, as it would for 
an isothermal sphere, which should be changed. If one is 
willing to assume the orbits within the halo are isotropic, 
then this can be done by using the expressions for the radial 
velocity dispersion we provide in Appendix A. We will show 
the results of doing this in the next subsection. 

On large scales, the difference between the symbols and 
the solid curve is a measure of the importance of velocity 
correlations. The relatively good agreement on larger scales 
suggests that our neglect of the halo velocity correlation 
function is not a bad approximation. We think it remarkable 
that we are able to provide a reasonably accurate description 



of the rise and fall of the pairwise velocity dispersion without 
once mentioning the three-point correlation function. 

5.3 Including velocity correlations 

This subsection shows how one might include the effects 
of correlated velocities. To do so, we briefly summarize the 
results of Sheth & Diaferio (2000). They argued that in a 
model in which all particles are in haloes, such as the one we 
are studying in this paper, it is sensible to write a particle's 
velocity as the sum of two terms: 



Wvir + Vhalo, 



(27) 



where v v ir is the virial motion of the particle about the 
halo centre of mass, and Uhaio is the motion of the par- 
ent halo. Let a 2 (m) denote the dispersion of particle ve- 
locities in m-haloes. It can be written as the sum of the two 
terms: a 2 iT {m) + cr^ alo (m). Sheth & Diaferio showed that 
a 2 iT oc Gm/r v i r , and that (appropriately smoothed) linear 
peak theory (Bardeen et al. 1986) could be used to esti- 
mate o"haio(m) at any given time rather accurately. Namely, 
Chaio(wi) ~ HQ 06 a-i(m) C(m), where <r_i is computed by 
multiplying P(k) with a smoothing filter W[kR(m)] of scale 
R(m), integrating over k, and dividing by 2ir 2 , and C(m) 
is a correction factor which accounts for the fact that peaks 
have slightly lower rms velocities than random patches (see 
Sheth & Diaferio for the exact expressions). 

In such a model, virial motions are random: the virial 
velocity of a particle is not correlated with the motion of the 
other particles in the halo, nor with the value of i>haio, nor 
with the motion of any other particle in any other halo. This 
means that correlated motions arise because halo motions 
may be correlated, and not otherwise. If the virial velocity 
is independent of pair separation r, then $(mi,m2|r) in 
equation ( Ba ) represents the correlations of halo motions 
for mi and m2-haloes separated by r. Strictly speaking, a 
range of halo separations can contribute to the same particle 
separation r, but we are ignoring that here, just as we did 
when computing the correlation function. 

Since linear theory predicts how the velocity correla- 
tion function depends on smoothing scale (Gorski 1988), and 
since linear theory provides a reasonably good description 
of the rms motions of haloes (Sheth & Diaferio 2000) , it is 
relatively straightforward to include the linear theory cor- 
relations in our model. The final term comes from the fact 
that the statistic is not centred. 

We will approximate the velocity correlation as follows. 
Suppose for the time being that ignoring the peak con- 
straint gave a reasonable approximation to halo velocities. 
Then <Thaio(wi) w HQ 06 a-i(m). Similarly, the correlation in 
velocity between patches of different sizes, say R(mi) and 
R(ni2), along the line of their separation, is 



V>(mi,m 2 |r) 



where 



c\k 

— -P(k) W(k\ mi ,m 2 ) K(kr), (28) 
2ir z 



W(k\mi,m2) = W[kR(mi)] W[kR(m 2 )], 

and the Ws are the Fourier transforms of tophat window 
functions, and 



K{x) = 



5r( 



sin x — a; cos x) 



© 0000 RAS, MNRAS 000, 000-000 



10 R. K. Sheth et al. 



1000 - 



B 



"Si 500 - 



SCDM 2 = 


ACDM 2 = 


- 


- 


°& 


- 




4&*% 








A*^A 




_ 


A ^ A 


x^ * \0 A 




x X " 


x y , x \ch 


" 


J*^- — ~v :"£ A 


\^ 


_ 


-* \ x-^V 


r *-£ 




j'"^"* 


'i x 


X^(V 


/ * 


/ 




/ \ 


/ * 


- 


' \ 


/ \ 




/ . 






~- \ 


h ^ ' \ 










. \ 


- — " \ 




\_ 


\ 











0.1 1 10 

r [Mpc/ft,] 



0.1 1 10 

r [Mpc/ft] 



Figure 5. Scale dependence of the pairwise velocity dispersion. 
Symbols, the same as in the previous figure, represent the simula- 
tion results. Solid curves show the scale dependence in our model 
after including the fact that the dispersion within a halo depends 
on position within the halo, and also accounting crudely for the 
effects of spatial as well as velocity correlations. The two dashed 
curves show the contribution from pairs in which both particles 
are in the same halo (peak at small r) and in separate haloes 
(peak at large r). 



This generalizes the expression in Gorski (1988), which as- 
sumed R(mi) — R(m 2 ). A simple way to include the peak 
constraint, at least approximately, is to multiply by the ap- 
propriate peak constraint factors: 

, T ,/ l \_ Chalo(mi) cr ha i (m 2 ) . . . , . 

w(mi,TOar)« j — {■ -. r ip(m 1 ,m 2 \r). (29) 

cr-i(mi) CT_i(m 2 ) 

This includes the fact that peaks have lower rms velocities 
than random patches, but assumes that the correlations are 
otherwise unchanged. 

At first sight, inserting this correlation term into equa- 
tion (E3j) appears to involve a triple integral — one over mi, 
one over m 2 , and one over k. In practice, it is much more 
efficient to rearrange the order of the integrals so that the 
integral over k is done last. The integrals over mi and m 2 
are separable and equal. If we use £(fc) to denote the re- 
sult of the integral over m of mn(rn) times the window 
function W[kR(m)], and Et(fc) to denote the integral of 
mn(m) b(m) W [kR(m)], then the remaining integral over k 
is really of the form J dk P(k) [£ 2 (fc) + Eg(fe)] K{kr)/2-K 2 . 
The result of all this is that 



S(mi,m 2 \r) 



o"vir(rrai) +o- vir (m 2 ) 
+0"Lio(mi) + o-h a i (m 2 ) 
6(mi)&(m 2 ) vl 2 (r) 



+ 



2[l + b{m 1 )b(m 2 )Zo(r)Y 



2<f(mi,ra 2 |r) 
(30) 



where the <rs are assumed to be the dispersions in one dimen- 
sion. The final term comes from the fact that the statistic is 



not centred. It was obtained by expanding {(vi 



v 2/ 



2 (1 + 



biSi)(l + b 2 S 2 )) in 5\ and 8 2 , and using the linear theory 
relation between v and S (e.g. Fisher 1995). 

Fig. y compares our model with what is measured in 
simulations. The symbols are the same as in the previous fig- 
ure; they are included to show the uncertainties associated 
with the measurements to date, and the influence of the fi- 



nite size of the simulation box. As before, we have accounted 
for the finite box size (which we set to L = 141 Mpc/h) when 
showing our model predictions. The two dashed curves are 
the model predictions for pairs in which both particles are 
in the same halo (peak at small r), and in different haloes 
(peak at large r). The solid curve is the sum in quadrature 
of the two contributions, plus a piece which comes from the 
fact that the statistic is not centred. 

In addition to including the effects of velocity correla- 
tions (which affects the two- halo term), we have modified 
the one-halo term to include the fact the velocity dispersion 
within a halo depends on position within the halo, as de- 
scribed in the Appendix. A glance at Fig. A2 shows that, 
for massive haloes, the isothermal assumption overestimates 
the dispersion. Because the peak is mainly due to pairs in 
massive haloes, using what is, arguably, the more realistic 
value for the dispersion lowers the height of the peak. With 
these two changes, our model falls significantly below the 
SCDM results. In the ACDM model, on the other hand, our 
model appears to be in reasonable agreement with the simu- 
lations, though it too underestimates the simulation results, 
especially on small scales. 

5.4 Dependence on trace-particle type 

Our model has the virtue that it is straightforward to study 
how the pairwise velocity dispersion depends on the type of 
trace particle. For example, if only haloes were used to con- 
struct this statistic, one would expect the large scale asymp- 
totic value of <7i 2 to be smaller than for the dark matter, 
since, in this case, the virial term does not contribute to 
the dispersion. Figure 1 of Sheth & Diaferio (2000) suggests 
that the term which remains depends slightly on halo mass, 
and that linear theory, smoothed on the appropriate scale, 
provides a reasonably good estimate of what it is. 

In addition to being smaller than the dark matter pair- 
wise dispersion, the scale dependence of <ri 2 (r-) for haloes 
will also be different than it is for dark matter. We can use 
the results above to estimate how it scales. For halos, there 
is no contribution from virial motions, so 



halos / 
0-12 ( 



where 



1 + S h 



dmi dm; 



1 + Chh(mi,m 2 |r) 



[1 + Ehh(r)] 

x n(mi)n(m 2 ) H(mi,m 2 \r) 



(31) 



dmi dm 2 n(mi) n(m 2 ) 



1 +£hh0m,ra 2 |r) 



and H(mi,m 2 \r) is given by equation (BOJ) with the virial 
terms set to zero. A more exact expression, appropriate for 
peaks identified with the same smoothing scale (mi = m 2 in 
our notation), has been provided by Regos & Szalay (1995): 
our approximate expression is very similar to their equa- 
tion (68). 

Equation ( |31[ ) shows that ai 2 loB at intermediate r 
should be slightly smaller than the asymptotic large r value, 
whereas the pairwise dispersion is larger at intermediate r 
for the dark matter. 

Galaxies in massive haloes are essentially trace parti- 
cles, so their motions are similar to the motions of dark 
matter particles (see Sheth & Diaferio 2000 for some sub- 
tleties associated with motions of galaxies in less massive 
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haloes). For the velocity dispersion statistic, the important 
difference between galaxies and dark matter particles is that, 
whereas the number of dark matter particles in a halo is pro- 
portional to halo mass, and the number of haloes in a halo 
is unity (of course!) for all haloes, the number of galaxies 
in a halo is something in between. Essentially, this happens 
because the number of galaxies in a halo is proportional not 
to the total amount of gas in the halo, but to the amount of 
gas which can cool. So the number of galaxies increases with 
halo mass, but not as quickly as the number of dark matter 
particles does. Thus, relative to the statistics of dark matter 
particles, galaxies downweight the contribution from mas- 
sive haloes. Since massive haloes have large virial motions, 
these are less pronounced for galaxies than for dark mat- 
ter. As a result, the pairwise dispersion of galaxies will be 
smaller in amplitude, and less scale-dependent, than that of 
the dark matter. This was first noticed by Jing, Mo & Borner 
(1998), who pointed out that this is what was required for 
consistency with observations. 

Peacock & Smith (2000) used a simple N ga ,i(m) pre- 
scription to generate galaxies from their dark matter simu- 
lations. They then measured the pairwise dispersion of the 
model galaxies in their simulations. Our analytic estimate of 
the pairwise dispersion of the dark matter allows us to do an- 
alytically what Peacock & Smith did numerically. First, we 
use the -/V ga i(m) relation to compute the correlation func- 
tion of galaxies. Essentially, this can be done by setting 
p(r\m) — > p(r\m) N ga ,i(m)/in in Section PJ (but see Seljak 
2000, Peacock & Smith 2000, or Scoccimarro et al. 2000 for 
some subtleties associated with how exactly this is done) . We 
then use this pair-weighting to compute the velocity disper- 
sion of our model galaxies. 

Specifically, we set 



'glhal 



„(r) = 



1 



dm 



Gm 



1 +£gal(r) J TVir 

<^ g 2 ai(m)) n (m) A(r|m) 



(32) 



"gal 



'gal 



where n ga i = J dmmn(m)N ga ,\(m)/m and £ ga i(r) is the 
galaxy correlation function (see e.g. Seljak 2000; Scocci- 
marro et al. 2000), and 



■"g2halo 



(0 



drnidm; 



JV ga i(mi) 
mi 



1 +£(mi,m 2 |r) 

[1+CgalW] 



lll 2 



min(mi) ra-2,n,{m-i) 



"sal 



"■gal 



S(mi,m 2 \r), (33) 



where S(rni,m2\r) is as in equation fed). 

To illustrate, Fig. @ shows the result of doing this using 
the iVgai(ra) relation obtained from the publically available 
GIF ACDM semi-analytic galaxy formation models of Kauff- 
mann et al. (1999). We actually used the simple fit to this 
relation, for galaxies brighter than Mb < —17.5 + 51og/t 
after correcting for the effects of dust, provided by Sheth & 
Diaferio (2000). The panel on the left shows the correlation 
functions of the dark matter (open circles) and the galax- 
ies (stars), and our models for the two correlation functions 
(solid lines). The bottom left panel shows the square root of 
the ratio of the two correlation functions. 

The open circles in the panel on the right show the 
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Figure 6. The correlation function and pairwise velocity disper- 
sion for simple models of galaxies. Open circles in both panels 
show the statistics of the dark matter in the ACDM GIF simula- 
tion, and stars show the corresponding statistics computed using 
the semianalytic bright, extinction corrected galaxies of Kauff- 
mann et al. (1999), measured in the same simulation. Solid curves 
show our model predictions. The two bottom panels show the 
square root of the ratio of the galaxy and dark matter correlation 
functions, and the ratio of the streaming motions and rms pair- 
wise velocities, respectively. Dashed curves in the top panels show 
our model predictions for the semianalytic models of Benson et 
al. (2000). 



mean streaming motions and the pairwise dispersions of the 
dark matter particles, and the stars show the corresponding 
statistics for the semianalytic galaxies. We assigned veloci- 
ties to the galaxies slightly differently than how Kauffmann 
et al. (1999) did. Namely, for the central galaxy in a halo, 
and for all haloes in which there was only one galaxy, we used 
the halo centre of mass velocity, rather than the velocity of 
the nearest particle to represent the motion of the galaxy. 
The difference between the two speeds is typically about 80 
km/s; this means that our values for the mean streaming 
and velocity dispersion statistics are slightly smaller than 
the ones presented in Fig. 13 of Kauffmann et al. (1999). 

The bottom right panel shows the ratio of the mean 
streaming motions of galaxies to that of the dark matter 
(filled circles) and the ratio of the rms pairwise velocity of 
galaxies to that of the dark matter (filled stars). Solid lines 
show our model predictions. We used the simpler isothermal 
approximation, rather than the actual position dependent 
dispersion, when computing the model predictions. Both in 
the simulations and in our model, the pairwise dispersion for 
the galaxies falls below that of the dark matter, although our 
model overestimates the amount by which this happens. The 
streaming motions of galaxies are similar to those of the dark 
matter on large scales because the galaxies have a bias factor 
which is close to unity. Presumably, if the large scale bias 
factor were different from unity, the streaming motions on 
large scales would also be different from the dark matter. On 
small scales, however, the galaxy mean streaming nrotions 
can be rather different from that of the dark matter; our 
model is able to provide quite a good description of this 
difference. 

Benson et al. (2000) showed that the pairwise disper- 
sions of galaxies in their semianalytic galaxy formation mod- 
els are lower than those of Kauffmann et al. (1999), and that 
the reason for this was because the two models have different 
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N ga \(m) relations. Therefore, we also looked at a model in 
which JVg a i(ra) = (ra/lO 12 ' 9 M //i) ' 6 for haloes more mas- 
sive than 3 x 10 11 M®/h; this provides a good fit to the Ben- 
son et al. (2000) models with M B < -19.5 +5 log ft, and is 
similar to the sort of scaling which Jing, Mo & Borner (1998) 
argued was required for agreement with observations. The 
dashed curves in the two upper panels of Fig. n show our 
model predictions when this relation is used. The correlation 
functions of the two semi-analytic models are similar despite 
the different brightness cuts. However, because this model 
has fewer galaxies in massive haloes than the Kauffmann 
et al. model, the pairwise dispersion of the galaxies in this 
model is lower. This is because, compared to the correlation 
function, the pairwise dispersion has an extra Gm/r v j r tx 
m 2 ' 3 dependence on halo mass, so it is that much more 
sensitive to the presence or absence of massive haloes. The 
Benson et al. models for galaxies with Mb < —18.5 + 5 log h 
are well fit by N gaX (m) = (m/10 12 ' 6 M©//i) a75 . If we use 
this relation instead, then the amplitude of <ri2 increases. 
This is consistent with what Benson et al. measured in their 
simulations. 



6 DISCUSSION 

Writing the correlation function as the sum of two terms, 
one which is essentially described by linear theory, and dom- 
inates on large scales, and another which is inherently non- 
linear, and dominates on small scales, is both accurate and 
useful. Such a split has been used to model the spatial dis- 
tribution of the dark matter and of galaxies (Seljak 2000; 
Peacock & Smith 2000; Scoccimarro et al. 2000). Sheth & 
Diaferio (2000) show that a similar split can be applied to 
velocities; particle velocities can also be written as the sum 
of linear and nonlinear parts. In this paper, we used this 
split to compute simple estimates not just of the correlation 
function of dark matter particles, but of the single particle 
velocity dispersion (the density weighted temperature) and 
its evolution, and the scale dependence of the first and sec- 
ond moments of the pairwise velocity distribution as well. 

Our model provides a good description of the scale de- 
pendence of the mean streaming velocities (Figs. H and pf). 
It also provides a reasonably good description of the scale 
dependence of the pairwise dispersion, provided one restricts 
attention to scales larger than a Megaparsec or so (Figs. H 
and rJ); our model appears to underestimate the value of 012 
on scales smaller than 1 Mpc/h. This is, perhaps, surprising, 
because on these smaller scales, most pairs are in the same 
halo, and one might have thought that virialized haloes were 
rather simple to model. Our model assumes that virialized 
haloes are smooth (or, more importantly, that substructure 
does not substantially affect the velocity dispersion), and 
that the velocity dispersion within them is isotropic. The 
discrepancy between our model and the simulations sug- 
gests that one or both of these assumptions is wrong. To 
model the small separation regime accurately, we may have 
to resort to using the exact approach based on the BBGKY 
hierarchy (e.g. Peebles 1980). As we mentioned in the in- 
troduction, this approach is considerably more complicated, 
because it requires knowledge of the three-point correlation 
function. Although this can be done within the context of 



these halo models using results presented in Scoccimarro et 
al. (2000), we have not done so here. 

Despite the shortcomings of our model for the pairwise 
dispersion on small scales, we feel that there are at least two 
reasons why it is useful. First, it is much simpler than the 
exact approach one is led to from the BBGKY hierarchy, 
for which knowledge of the three-point correlation function 
is required. Secondly, it is easily extended to provide predic- 
tions as a function of trace particle type. This is particularly 
useful for compar ing theoretical models with observations of 
galaxies. Section 5.4 showed how to model the dependence 
on separation of the galaxy pairwise velocity dispersion. 

The main reason for doing this was the following. Our 
model for the linear and nonlinear contributions to the num- 
ber of pairs at a given separation can be combined with 
Sheth & Diaferio's model for the linear and nonlinear con- 
tributions to velocities, to model how the full distribution of 
pairwise velocities (not just the first and second moments) 
depend on pair separation. Such a model can be extended 
to describe galaxies, just as we did here. Following Fisher 
(1995), this allows us to estimate the effect of redshift space 
distortions on the shape of the galaxy correlation function 
over the entire range of linear to nonlinear scales. This is the 
subject of work in progress. 
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APPENDIX A: HALO PROFILES 

We study a number of different halo profiles below. For each 
profile, we will be interested in how the density p, the cir- 
cular velocity v c , the radial velocity dispersion of, an d the 
potential 4> depend on distance from the halo centre. In ad- 
dition, we will be interested in the convolution of a profile 
with itself: A. 

The circular velocity at a distance r from the centre of 
a halo is defined as the square root of the ratio of the mass 
interior to r and r: 



v c (r\m) = = / Ax x p(x\m) 

r r Jo 

This, averaged over the profile shape, is 
mV c (m) = 47r / dr r p(r\m)v c (r\m). 
The potential energy at r is 



(Al) 



(A2) 



(r\rn) — 



-4ttG* 



dr r p(r \m) 



>o 
-4ttG / dr' r' p(r'\m). (A3) 

J r 

The total potential energy is this, integrated over the halo: 

W(m) — 2tt / dr r 2 p(r\m) <j>(r\m) - -raV c 2 (m), (A4) 



where the final equality follows after rearranging the order 
of the integrals. If a halo is assumed to be in equilibrium, 
and the orbits within it are assumed to be isotropic, then the 
Jeans equation can be used to compute the radial velocity 
dispersion a 2 : 



A pa 2 , , . dd> , , . Gm(< r) 
- -££. = p (r\m) -g = p{r\m) ^-L. (A5) 

The total kinetic energy of the halo is 

rw \ 3 , f , 2 , i \ 2, i \ 3 mV c (m) ,. „, 

K(m) = - 4ty dr r p(r\m)a r (r\m) — — -. (Ad) 

2 J 2 3 

These relations between W, K and the circular velocity are 
true for all profiles, and will be a useful consistency check in 
what follows. Notice that —W(m) — 2K(m): the haloes are 
in virial equilibrium. 



Al The Hernquist profile 

The mass associated with this profile is finite even though 
the halo extends smoothly to infinity. For this reason, it will 
be a very useful benchmark calculation in what follows. For 
a halo of mass m at r v i r , the profile is 

p H (s) A vir 26(l + &) 2 A vir 2(l + 6) 2 



3f2 s{b + s) 



m 6 3 a;(l-|-x) 3 



(A7) 



where A v i r is the average density within r v i r in units of the 
critical density, s — r/r v i r , b is a core radius in units of r v i r , 
and x = s/b. The mass interior to r is given by 



m(<r) _ (1+bfx 2 



m (1 + x) 2 

the circular velocity is 

Gm(<r) Gmx(l + b) 2 



v c (r) - 



r vir &(l + x) 2 ' 



(A8) 



(A9) 



and the radial velocity dispersion <r 2 , which can be computed 
from the Jeans equation, is also analytic: 



ggfrj 

Gm/r v i T 



a+b) 2 



12b 



12«(l + ») 3 ln(i±^) 



25a; + 52x 2 



42a; 3 + 12a- 4 



1 +x 



(A10) 



(Hernquist 1990; Cole & Lacey 1996). The potential energy 
at r of such a halo is 



<Kr) 



Gm (1 + fe) 2 



r V ir 6(1+ a;) 
and so the total potential energy of the halo is 



W K 



4tt 



dr r pn(r) <f)(r) 



Gm 2 (1 + b) 4 
6r v i r b 



(All) 



(A12) 



(Hernquist 1990). It is straightforward to verify that the ki- 
netic energy Kh = —Wn/2: the halo is in virial equilibrium. 
A similar averaging of v 2 (r) equals —Wn. 

The convolution of such a profile with itself is 



(A13) 



47rr 3 ir b z 


X-a(r\m,b) 4 hi(x) - h 2 {x) 


in 2 


(1 + 6) 4 a 4 (2 + x) 4 ' 


hi(x) 


24 + 60a; + 56a 2 + 24a; 3 + 6a 4 + a 5 


1 +x 


h 2 (x) 


12 (1 + x) (2 + 2x + x 2 ) ln(l + a) 



and 



A little algebra shows that 2nG J dr r An(r|m, b) equals the 
potential energy of the halo, as it should. 
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A2 The truncated singular isothermal sphere 

In this case a halo of mass m is truncated at its virial radius 
TVir. On scales smaller than this, 

p(r) _ A vir /3n 

— 9 ' 

p s 2 

where s — r/r v i r , and A v i r is a constant which specifies how 
dense the halo is relative to the critical density at the time: 
3m/47rrJ ir = A v i r p C rit- The circular velocity within the halo 

is 



(A14) 



= Gm/r V i T , 



(A15) 



and the radial velocity dispersion within the halo is a 2 = 
f c /2; for an isothermal sphere, v c and a r are independent of 
position within the halo. The convolution of such a profile 
with itself is 



4%r: 



— \i so (r\m) = ^ I — In 
/a x 



if < s < 2, (A16) 



and the truncation means that it is zero on separations larger 
than twice r v i r . So, for a truncated isothermal sphere 

■2-C; I dr r\ lso (r\m,b) = ^-; (A17) 



this equals the circular velocity averaged over the halo, 
which in turn equals 2_K'i so = — Wj bo . 



A3 The truncated NFW profile 

We could go through a similar exercise for a truncated Hern- 
quist profile. Instead, we will study another profile which 
declines slightly less steeply at the edge, and so is able to fit 
the results of numerical simulations slightly better (Navarro, 
Frenk & White 1997). The NFW profile contains mass m 
within r v i r , and it is truncated at this scale. Within the 
virial radius, 



A vir 



/(«) 



P(r) _ 
p " 3fi a 3 x{l + x) 2 



(A18) 



where x — r/a, with s = r/r v i r as before, a is the core radius 
in units of r v [ T , and 



/(<*) = 



ln(l + l/o) - 1/(1 + a) 



The potential at r is 

Gm f(a) [ln(l + a:) 



1+a 



(A19) 



(A20) 



The corresponding expression in Cole & Lacey (1996) does 
not have the second term in the square brackets because 
we are assuming the halo is truncated at the virial radius, 
whereas they did not. The circular velocity is 



v c (r) 



Gm f(a) 



ln(l + a;) 



1 + x 



(A21) 



and the total potential energy is 
Gm 2 f(a) 2 



Wnfw 



2r\,ir a(l + a) 



l-2aln(l + l/a) + 



a 



1 + a 



.(A22) 



for the truncated potential given above. This equals the 
average of the circular velocity over the halo. If the halo 



is not truncated (equation A2C without the second term 



in the square brackets), then the total potential energy is 
Wnfw = — (Gm 2 /r v i r ) f 2 (a)/2 if we integrate over all r, 
and it is 

Wnfw - - ^- f} a } , [1 - a ln(l + l/o)] , (A23) 

2 r v ir a(l + a) 

if we integrate out to r v i r only. 

The radial velocity dispersion, computed from the Jeans 
equation, is 

a 2 {r) _ f(a) 2 x(l + x) 2 



Gm/r v ir 
where 


2a f(a) 


g(x) = 


1 1 

x + {1+x) 2 



ff(l/o) ^ 9{x) 



6 , x 

+ ^— — +ln 



(A24) 



1 + x 



1 + x 



+- 



6x 2 + 3a; - 1 



x 2 {l + x) 
the dilogarithm Li2(x) is defined by 



ln(l + x) - 31n 2 (l + x)+ 6Li 2 (x) 



Li 2 (a;) 



din 2 ln(l + z). 



This, in the expression for the total kinetic energy shows 
explicitly that — VKnfw = 2A'nfw- This relation is satisfied 
exactly because we truncated the halo at the virial radius. 
If we do not truncate, and we let the halo extend to infinity, 
then the expression above should be modified by setting 
g(l/a) - tt 2 - 1. 

The convolution of such a profile with itself gives 

47rr vir a 3 A N Fw(r|m, a) -4(1 + a) + 2ax(l + 2a) + a 2 x 2 



/W 2 



1 , 

H — =-ln 



2x 2 (l + a) 2 (2 + x) 
(1 + a — ax)(l + x) 



(1 + a) 



ln(l+aO , 
H — ,\ si if < s < 1 



^(2 + : 



ln[(l + a)/(ox + a - 1)] 
x(2 + xf 
if 1 < s < 2. 



+ 



a x — 2a 



2x(l + a) 2 (2 + x) 



(A25) 



In the limit in which the virial radius is much larger than 
the core radius, this separates into the product of a function 
of a and another of x: 



ANFw(f|m, a) 



m 2 f(a) 2 2 

47rr^ ir a 3 x 2 (x + 2) 

(x 2 + 2a; + 2) ln(l + x) 



x 



x(x + 2) 



Straightforward but tedious algebra shows that the integral 
of 2-rGt 2 Anfw(?" \m, a)/r equals Wnfw, the average of the 
potential over the halo. Again, this equality is exactly sat- 
isfied only becaus e we hav e sel f-consistently truncated our 
haloes (equations A25 and A22). 



It is an interesting question as to whether or not trunca- 
tion is important. Although they discuss NFW haloes trun- 
cated at the virial radius (so they have finite mass) it ap- 
pears that Cole & Lacey (1996) do not truncate their NFW 
haloes when computing <f> and ay. As a result, their values 
of ay do not go to zero at the virial radius, and their values 
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Figure Al. Shape of the convolution of the density profile as a function of distance in units of the virial radius for a few representative 
values of the core radius a. Small values of a (left panel) correspond to low mass haloes, whereas more massive haloes have larger core 
radii (right panel). Solid curves show the result for NFW haloes truncated at the virial radius (lower amplitude at large r) and when the 
haloes are allowed to extend to infinity (larger amplitude at large r), and dashed curves show the corresponding result for a Hernquist 
profile which contains the same mass within the virial radius. The core radius of the Hernquist profile is %/2aP' 7r> . 
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Figure A2. Radial velocity dispersion as a function of distance from halo centre if orbits are isotropic for the same models as the 
previous figure. Solid curves show the dispersion in NFW haloes truncated at the virial radius, dotted curves show the isothermal value 
(which is independent of distance from centre) and dashed curves show the result for the corresponding Hernquist profile. 



of Wnfw/27\nfw at the virial radius are slightly greater 
than unity. Both these are in agreement with the simula- 
tion results they present. Since NFW haloes are not really 
isolated objects, it may be that Cole & Lacey's decision to 
ignore the truncation at the virial radius when computing 
all quantities except the mass is more physically reasonable. 

Deciding whether to truncate or not is important if one 
wishes to include the scale dependence of the velocity dis- 
persion within a halo into our model for the pairwise velocity 
dispersion. We found that using our formula for truncated 
NFW haloes, and then integrating over the distribution of 
halo masses produced values of an which were about 15% 
lower than the simulation results presented in Fig. H. If we 



do not truncate, our models are better able to reproduce the 
measurements in simulations, suggesting that this is, indeed, 
the correct thing to do. 



The solid curves in Fig. A2 show the radial velocity 



disp ersio n as a function of radius for truncated NFW haloes 
(eq. A24 ) for three representative values of the concentration 



parameter a; panels on the left correspond to the least mas- 
sive haloes. The dotted curves show the isothermal value: 
one half of GM/r,i„ and dashed curves show the corre- 
sponding result for an infinite Hernquist profile, with core 
radius b = \/2a ' 75 . With this scaling, the Hernquist for- 
mula for the dispersion provides a good approximation to 
what happens if we remove the condition that the NFW 
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profile is truncated. This is useful because the Hernquist 
radial velocity profile is analytic. 
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